Crank–Nicolson method

In numerical analysis, the Crank–Nicolson method is a finite difference method used for numerically solving the heat equation and similar partial differential equations.[1] It is a second-order method in time, implicit in time, and is numerically stable. The method was developed by John Crank and Phyllis Nicolson in the mid 20th century.[2]

For diffusion equations (and many other equations), it can be shown the Crank–Nicolson method is unconditionally stable.[3] However, the approximate solutions can still contain (decaying) spurious oscillations if the ratio of time step to the square of space step is large (typically larger than 1/2). For this reason, whenever large time steps or high spatial resolution is necessary, the less accurate backward Euler method is often used, which is both stable and immune to oscillations.

Contents

The method

The Crank–Nicolson method is based on central difference in space, and the trapezoidal rule in time, giving second-order convergence in time. For example, in one dimension, if the partial differential equation is

\frac{\partial u}{\partial t} = F\left(u, x, t, \frac{\partial u}{\partial x}, \frac{\partial^2 u}{\partial x^2}\right)

then, letting u(i \Delta x, n \Delta t) = u_{i}^{n}\,, the equation for Crank–Nicolson method is a combination of the forward Euler method at n and the backward Euler method at n + 1 (note, however, that the method itself is not simply the average of those two methods, as the equation has an implicit dependence on the solution):

\frac{u_{i}^{n %2B 1} - u_{i}^{n}}{\Delta t} = 
F_{i}^{n}\left(u, x, t, \frac{\partial u}{\partial x}, \frac{\partial^2 u}{\partial x^2}\right) \qquad \mbox{(forward Euler)}
\frac{u_{i}^{n %2B 1} - u_{i}^{n}}{\Delta t} = 
F_{i}^{n %2B 1}\left(u, x, t, \frac{\partial u}{\partial x}, \frac{\partial^2 u}{\partial x^2}\right) \qquad \mbox{(backward Euler)}
\frac{u_{i}^{n %2B 1} - u_{i}^{n}}{\Delta t} = 
\frac{1}{2}\left(
F_{i}^{n %2B 1}\left(u, x, t, \frac{\partial u}{\partial x}, \frac{\partial^2 u}{\partial x^2}\right) %2B 
F_{i}^{n}\left(u, x, t, \frac{\partial u}{\partial x}, \frac{\partial^2 u}{\partial x^2}\right)
\right) \qquad \mbox{(Crank-Nicolson)}

The function F must be discretized spatially with a central difference.

Note that this is an implicit method: to get the "next" value of u in time, a system of algebraic equations must be solved. If the partial differential equation is nonlinear, the discretization will also be nonlinear so that advancing in time will involve the solution of a system of nonlinear algebraic equations, though linearizations are possible. In many problems, especially linear diffusion, the algebraic problem is tridiagonal and may be efficiently solved with the tridiagonal matrix algorithm, which gives a fast \mathcal{O}(n) direct solution as opposed to the usual \mathcal{O}(n^3) for a full matrix.

Example: 1D diffusion

The Crank–Nicolson method is often applied to diffusion problems. As an example, for linear diffusion,

{\partial u \over \partial t} = a \frac{\partial^2 u}{\partial x^2}

whose Crank–Nicolson discretization is then:

\frac{u_{i}^{n %2B 1} - u_{i}^{n}}{\Delta t} = \frac{a}{2 (\Delta x)^2}\left(
(u_{i %2B 1}^{n %2B 1} - 2 u_{i}^{n %2B 1} %2B u_{i - 1}^{n %2B 1}) %2B 
(u_{i %2B 1}^{n} - 2 u_{i}^{n} %2B u_{i - 1}^{n})
\right)

or, letting r = \frac{a \Delta t}{2 (\Delta x)^2}:

-r u_{i %2B 1}^{n %2B 1} %2B (1 %2B 2 r)u_{i}^{n %2B 1} - r u_{i - 1}^{n %2B 1} = r u_{i %2B 1}^{n} %2B (1 - 2 r)u_{i}^{n} %2B r u_{i - 1}^{n}\,

which is a tridiagonal problem, so that u_{i}^{n %2B 1}\, may be efficiently solved by using the tridiagonal matrix algorithm in favor of a much more costly matrix inversion.

A quasilinear equation, such as (this is a minimalistic example and not general)

\frac{\partial u}{\partial t} = a(u) \frac{\partial^2 u}{\partial x^2}

would lead to a nonlinear system of algebraic equations which could not be easily solved as above; however, it is possible in some cases to linearize the problem by using the old value for a, that is a_{i}^{n}(u)\, instead of a_{i}^{n %2B 1}(u)\,. Other times, it may be possible to estimate a_{i}^{n %2B 1}(u)\, using an explicit method and maintain stability.

Example: 1D diffusion with advection for steady flow, with multiple channel connections

This is a solution usually employed for many purposes when there's a contamination problem in streams or rivers under steady flow conditions but information is given in one dimension only. Often the problem can be simplified into a 1-dimensional problem and still yield useful information.

Here we model the concentration of a solute contaminant in water. This problem is composed of three parts: the known diffusion equation (D_x chosen as constant), an advective component (which means the system is evolving in space due to a velocity field), which we choose to be a constant Ux, and a lateral interaction between longitudinal channels (k).

\langle 0 \rangle\frac{\partial C}{\partial t} = D_x \frac{\partial^2 C}{\partial x^2} - U_x \frac{\partial C}{\partial x}- k (C-C_N)-k(C-C_M)

where C is the concentration of the contaminant and subscripts N and M correspond to previous and next channel.

The Crank–Nicolson method (where i represents position and j time) transforms each component of the PDE into the following:

\langle 1 \rangle \frac{\partial C}{\partial t} = \frac{C_{i}^{j %2B 1} - C_{i}^{j}}{\Delta t}
\langle 2 \rangle\frac{\partial^2 C}{\partial x^2}= \frac{1}{2 (\Delta x)^2}\left(
(C_{i %2B 1}^{j %2B 1} - 2 C_{i}^{j %2B 1} %2B C_{i - 1}^{j %2B 1}) %2B 
(C_{i %2B 1}^{j} - 2 C_{i}^{j} %2B C_{i - 1}^{j})
\right)
\langle 3 \rangle\frac{\partial C}{\partial x} = \frac{1}{2}\left(
\frac{(C_{i %2B 1}^{j %2B 1} - C_{i - 1}^{j %2B 1})}{2 (\Delta x)} %2B 
 \frac{(C_{i %2B 1}^{j} - C_{i - 1}^{j})}{2 (\Delta x)}
\right)
 \langle 4 \rangle C= \frac{1}{2} (C_{i}^{j%2B1} %2B C_{i}^{j})
 \langle 5 \rangle C_N= \frac{1}{2} (C_{Ni}^{j%2B1} %2B C_{Ni}^{j})
 \langle 6 \rangle C_M= \frac{1}{2} (C_{Mi}^{j%2B1} %2B C_{Mi}^{j})

Now we create the following constants to simplify the algebra:

 \lambda= \frac{D_x\Delta t}{2 \Delta x^2}
 \alpha= \frac{U_x\Delta t}{4 \Delta x}
 \beta= \frac{k\Delta t}{2}

and substitute <1>, <2>, <3>, <4>, <5>, <6>, α, β and λ into <0>. We then put the new time terms on the left (j + 1) and the present time terms on the right (j) to get:

 -\beta C_{Ni}^{j%2B1}-(\lambda%2B\alpha)C_{i-1}^{j%2B1} %2B(1%2B2\lambda%2B2\beta)C_{i}^{j%2B1}-(\lambda-\alpha)C_{i%2B1}^{j%2B1}-\beta C_{Mi}^{j%2B1} = \beta C_{Ni}^{j}%2B(\lambda%2B\alpha)C_{i-1}^{j} %2B(1-2\lambda-2\beta)C_{i}^{j}%2B(\lambda-\alpha)C_{i%2B1}^{j}%2B\beta C_{Mi}^{j}

To model the first channel, we realize that it can only be in contact with the following channel (M), so the expression is simplified to:

 -(\lambda%2B\alpha)C_{i-1}^{j%2B1} %2B(1%2B2\lambda%2B\beta)C_{i}^{j%2B1}-(\lambda-\alpha)C_{i%2B1}^{j%2B1}-\beta C_{Mi}^{j%2B1} = %2B(\lambda%2B\alpha)C_{i-1}^{j} %2B(1-2\lambda-\beta)C_{i}^{j}%2B(\lambda-\alpha)C_{i%2B1}^{j}%2B\beta C_{Mi}^{j}

In the same way, to model the last channel, we realize that it can only be in contact with the previous channel (N), so the expression is simplified to:

 -\beta C_{Ni}^{j%2B1}-(\lambda%2B\alpha)C_{i-1}^{j%2B1} %2B(1%2B2\lambda%2B\beta)C_{i}^{j%2B1}-(\lambda-\alpha)C_{i%2B1}^{j%2B1}= \beta C_{Ni}^{j}%2B(\lambda%2B\alpha)C_{i-1}^{j} %2B(1-2\lambda-\beta)C_{i}^{j}%2B(\lambda-\alpha)C_{i%2B1}^{j}

To solve this linear system of equations we must now see that boundary conditions must be given first to the beginning of the channels:

 C_0^{j}: initial condition for the channel at present time step
 C_{0}^{j%2B1}: initial condition for the channel at next time step
 C_{N0}^{j}: initial condition for the previous channel to the one analyzed at present time step
 C_{M0}^{j}: initial condition for the next channel to the one analyzed at present time step

For the last cell of the channels (z) the most convenient condition becomes an adiabatic one, so

\frac{\partial C}{\partial x}_{x=z} = 
\frac{(C_{i %2B 1} - C_{i - 1})}{2 \Delta x}  = 0

This condition is satisfied if and only if (regardless of a null value)

 C_{i %2B 1}^{j%2B1} = C_{i - 1}^{j%2B1} \,

Let us solve this problem (in a matrix form) for the case of 3 channels and 5 nodes (including the initial boundary condition). We express this as a linear system problem:

 \begin{bmatrix}AA\end{bmatrix}\begin{bmatrix}C^{j%2B1}\end{bmatrix}=[BB][C^{j}]%2B[d]

where

\mathbf{C^{j%2B1}} = \begin{bmatrix}
C_{11}^{j%2B1}\\ C_{12}^{j%2B1} \\ C_{13}^{j%2B1} \\ C_{14}^{j%2B1} 
\\ C_{21}^{j%2B1}\\ C_{22}^{j%2B1} \\ C_{23}^{j%2B1} \\ C_{24}^{j%2B1} 
\\ C_{31}^{j%2B1}\\ C_{32}^{j%2B1} \\ C_{33}^{j%2B1} \\ C_{34}^{j%2B1} 
\end{bmatrix}   and   \mathbf{C^{j}} = \begin{bmatrix}
C_{11}^{j}\\ C_{12}^{j} \\ C_{13}^{j} \\ C_{14}^{j} 
\\ C_{21}^{j}\\ C_{22}^{j} \\ C_{23}^{j} \\ C_{24}^{j} 
\\ C_{31}^{j}\\ C_{32}^{j} \\ C_{33}^{j} \\ C_{34}^{j} 
\end{bmatrix}

Now we must realize that AA and BB should be arrays made of four different subarrays (remember that only three channels are considered for this example but it covers the main part discussed above).

\mathbf{AA} = \begin{bmatrix}
AA1 & AA3 & 0\\
AA3 & AA2 & AA3\\
0 & AA3 & AA1\end{bmatrix}   and  
\mathbf{BB} = \begin{bmatrix}
BB1 & -AA3 & 0\\
-AA3 & BB2 & -AA3\\
0 & -AA3 & BB1\end{bmatrix}  

where the elements mentioned above correspond to the next arrays and an additional 4x4 full of zeros. Please note that the sizes of AA and BB are 12x12:

\mathbf{AA1} = \begin{bmatrix}
(1%2B2\lambda%2B\beta) & -(\lambda-\alpha) & 0 & 0 \\
-(\lambda%2B\alpha) & (1%2B2\lambda%2B\beta) & -(\lambda-\alpha) & 0 \\
0 & -(\lambda%2B\alpha) & (1%2B2\lambda%2B\beta) & -(\lambda-\alpha)\\
0 & 0 & -2\lambda & (1%2B2\lambda%2B\beta)\end{bmatrix}   , 
\mathbf{AA2} = \begin{bmatrix}
(1%2B2\lambda%2B2\beta) & -(\lambda-\alpha) & 0 & 0 \\
-(\lambda%2B\alpha) & (1%2B2\lambda%2B2\beta) & -(\lambda-\alpha) & 0 \\
0 & -(\lambda%2B\alpha) & (1%2B2\lambda%2B2\beta) & -(\lambda-\alpha)\\
0 & 0 & -2\lambda & (1%2B2\lambda%2B2\beta) \end{bmatrix}   , 
\mathbf{AA3} = \begin{bmatrix}
-\beta & 0 & 0 & 0 \\
0 & -\beta & 0 & 0 \\
0 & 0 & -\beta & 0 \\
0 & 0 & 0 & -\beta \end{bmatrix}   , 
\mathbf{BB1} = \begin{bmatrix}
(1-2\lambda-\beta) & (\lambda-\alpha) & 0 & 0 \\
(\lambda%2B\alpha) & (1-2\lambda-\beta) & (\lambda-\alpha) & 0 \\
0 & (\lambda%2B\alpha) & (1-2\lambda-\beta) & (\lambda-\alpha)\\
0 & 0 & 2\lambda & (1-2\lambda-\beta)\end{bmatrix}   &  
\mathbf{BB2} = \begin{bmatrix}
(1-2\lambda-2\beta) & (\lambda-\alpha) & 0 & 0 \\
(\lambda%2B\alpha) & (1-2\lambda-2\beta) & (\lambda-\alpha) & 0 \\
0 & (\lambda%2B\alpha) & (1-2\lambda-2\beta) & (\lambda-\alpha)\\
0 & 0 & 2\lambda & (1-2\lambda-2\beta) \end{bmatrix}

The d vector here is used to hold the boundary conditions. In this example it is a 12x1 vector:

\mathbf{d} = \begin{bmatrix}
(\lambda%2B\alpha)(C_{10}^{j%2B1}%2BC_{10}^{j}) \\ 0 \\ 0 \\ 0 \\ (\lambda%2B\alpha)(C_{20}^{j%2B1}%2BC_{20}^{j}) \\ 0 \\ 0 \\ 0 \\ (\lambda%2B\alpha)(C_{30}^{j%2B1}%2BC_{30}^{j}) \\
0\\
0\\
0\end{bmatrix}

To find the concentration at any time, one must iterate the following equation:

 \begin{bmatrix}C^{j%2B1}\end{bmatrix}=\begin{bmatrix}AA^{-1}\end{bmatrix}([BB][C^{j}]%2B[d])

Example: 2D diffusion

When extending into two dimensions on a uniform Cartesian grid, the derivation is similar and the results may lead to a system of band-diagonal equations rather than tridiagonal ones. The two-dimensional heat equation

\frac{\partial u}{\partial t} = a \left(\frac{\partial^2 u}{\partial x^2} %2B \frac{\partial^2 u}{\partial y^2}\right)

can be solved with the Crank–Nicolson discretization of

\begin{align}u_{i,j}^{n%2B1} &= u_{i,j}^n %2B \frac{1}{2} \frac{a \Delta t}{(\Delta x)^2} \big[(u_{i%2B1,j}^{n%2B1} %2B u_{i-1,j}^{n%2B1} %2B u_{i,j%2B1}^{n%2B1} %2B u_{i,j-1}^{n%2B1} - 4u_{i,j}^{n%2B1}) \\ & \qquad {} %2B (u_{i%2B1,j}^{n} %2B u_{i-1,j}^{n} %2B u_{i,j%2B1}^{n} %2B u_{i,j-1}^{n} - 4u_{i,j}^{n})\big]\end{align}

assuming that a square grid is used so that \Delta x = \Delta y. This equation can be simplified somewhat by rearranging terms and using the CFL number

\mu = \frac{a \Delta t}{(\Delta x)^2}

For the Crank–Nicolson numerical scheme, a low CFL number is not required for stability, however it is required for numerical accuracy. We can now write the scheme as:

\begin{align}&(1 %2B 2\mu)u_{i,j}^{n%2B1} - \frac{\mu}{2}\left(u_{i%2B1,j}^{n%2B1} %2B u_{i-1,j}^{n%2B1} %2B u_{i,j%2B1}^{n%2B1} %2B u_{i,j-1}^{n%2B1}\right) \\ & \quad = (1 - 2\mu)u_{i,j}^{n} %2B \frac{\mu}{2}\left(u_{i%2B1,j}^{n} %2B u_{i-1,j}^{n} %2B u_{i,j%2B1}^{n} %2B u_{i,j-1}^{n}\right)\end{align}

Application in financial mathematics

Because a number of other phenomena can be modeled with the heat equation (often called the diffusion equation in financial mathematics), the Crank–Nicolson method has been applied to those areas as well.[4] Particularly, the Black–Scholes option pricing model's differential equation can be transformed into the heat equation, and thus numerical solutions for option pricing can be obtained with the Crank–Nicolson method.

The importance of this for finance, is that option pricing problems, when extended beyond the standard assumptions (e.g. incorporating changing dividends), cannot be solved in closed form, but can be solved using this method. Note however, that for non-smooth final conditions (which happen for most financial instruments), the Crank–Nicolson method is not satisfactory as numerical oscillations are not damped. For vanilla options, this results in oscillation in the gamma value around the strike price. Therefore, special damping initialization steps are necessary (e.g., fully implicit finite difference method).

See also

References

  1. ^ Tuncer Cebeci (2002). Convective Heat Transfer. Springer. ISBN 0966846141. http://books.google.com/?id=xfkgT9Fd4t4C&pg=PA257&dq=%22Crank-Nicolson+method%22. 
  2. ^ Crank, J.; Nicolson, P. (1947). "A practical method for numerical evaluation of solutions of partial differential equations of the heat conduction type". Proc. Camb. Phil. Soc. 43 (1): 50–67. doi:10.1007/BF02127704. .
  3. ^ Thomas, J. W. (1995). Numerical Partial Differential Equations: Finite Difference Methods. Texts in Applied Mathematics. 22. Berlin, New York: Springer-Verlag. ISBN 978-0-387-97999-1. . Example 3.3.2 shows that Crank–Nicolson is unconditionally stable when applied to u_t=au_{xx}.
  4. ^ Wilmott, P.; Howison, S.; Dewynne, J. (1995). The Mathematics of Financial Derivatives: A Student Introduction. Cambridge Univ. Press. ISBN 0521497892. http://books.google.co.in/books?hl=en&q=The%20Mathematics%20of%20Financial%20Derivatives%20Wilmott&um=1&ie=UTF-8&sa=N&tab=wp. 

External links